Table 1. Speech Recognition Performances (alphabet Recognition)

نویسنده

  • Patrick Haffner
چکیده

Highly structured artificial neural networks have been shown to be superior to fully connected networks for realworld applications like speech recognition and handwritten character recognition. These structured networks can be optimized in many ways, and have to be optimized for optimal performance. This makes the manual optimization very timeconsuming. A highly structured approach is the Multi State Time Delay Neural Network (MSTDNN) which uses shifted input windows and allows the recognition of sequences of ordered events that have to be observed jointly. In this paper we propose an Automatic Structure Optimization (ASO) algorithm and apply it to MSTDNN type networks. The ASO algorithm optimizes all relevant parameters of MSTDNNs automatically and was successfully tested with three different tasks and varying amounts of training data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

A Comparative Study of Multilayer Feed-forward Neural Network and Radial Basis Function Neural Network Models for Speech Recognition

The most common way of human-to-human communication is speech. As speech provides the easiest and most natural way of interaction, it becomes the need of human-to-machine communication as well. Automatic speech recognition (ASR) is the technology to enable machines to understand process and recognize speech. Due to its applicability in various application domains, ASR becomes one of the most fa...

متن کامل

Speech Recognition using the Epochwise Back Propagation through time Algorithm

In this paper, the artificial neural networks are implemented to accomplish the English alphabet speech recognition. The design an accurate and effective speech recognition system is a challenging task in the area of speech recognition. We implemented a new data classification method, where we use neural networks, which are trained and performance can be defined on the basis of recognition rate...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

English Alphabet Recognition Based on Chinese Acoustic Modeling

How to effectively recognize English letters spoken by Chinese people is our major concern in the paper. Some efforts are made to build Chinese extended Initial/Final (XIF) based HMMs for English alphabet recognition which can be integrated with large vocabulary continuous Chinese speech recognition (Chinese LVCSR) system based on a same XIF set. The alphabet-specific XIF HMMs are built using c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993